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Abstract: We characterized 14 anonymous nuclear loci from Pinus 
thunbergii Pari., an important pine species native to Japan. One hundred 
and twenty-six single nucleotide polymorphisms (SNPs) were identified 
from these loci, giving a frequency of 1 SNP per 51 bp. Nucleotide di¬ 
versity (0) ranged from 1.06 x 10' 3 to 11.87 x 10°, with an average of 
4.99 x 10' . Only one locus (mK45) deviated significantly from the 
Hardy-Weinberg equilibrium. Thirteen of 14 loci were applicable in other 
pine species. These loci will be useful for nucleotide variation studies 
and will provide material for SNP-based marker development in P. thun¬ 
bergii and related species. 

Keywords: anonymous nuclear loci; Pinus thunbergii ; SNP; SCAR; 
cross-species transferability 

Introduction 

Pinus thunbergii Pari, is an important pine species, native to 
Japan, which is endangered because of pine wilt disease (Ma- 
miya 1988). To cope with this problem, since 1978, a series of 
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genetic conservation and resistance breeding projects has fo¬ 
cused on selection and creation of resistant individuals (Nose and 
Shiraishi 2008). To assist these projects, genetic studies using 
molecular markers are important. 

Materials and methods 

In this study, we developed 14 anonymous nuclear loci using a 
locus-specific sequence characterized amplified region 
(SCAR)-based approach (Paran and Michelmore 1993). The 
genomic library was obtained by screening the monomorphic 
band produced by 60 random amplified polymorphic DNA 
(RAPD) primers (13-mer) in 16 individuals of P. thunbergii. 
The RAPD-PCR (Williams et al. 1990) reaction was conducted 
in a total volume of 10 pi containing 1 x PCR buffer, 2.0 mM 
MgCl 2 , 0.2 mM of each dNTP, 0.25 U Platinum Taq DNA poly¬ 
merase (Invitrogen), 1.5 pM primer, and 20 ng template DNA. 
The PCR cycles were as follows: 94°C for 1 min; followed by 40 
cycles of 94°C for 30 s, 50°C for 30 s, and 72°C for 1 min; fol¬ 
lowed by a final extension of 72°C for 2 min. The amplification 
product was analyzed by 1.5% agarose gel electrophoresis. A 
monomorphic band with an approximate fragment length of 
300-700 bp was excised from the gel and used as template DNA 
for re-amplification. The re-amplification product was ligated 
into the pGEM-T vector (Promega) and transformed into JM109 
competent cells (Promega). Universal primers -21 Ml3 and 
M13Rev were used to directly sequence both strands of the col¬ 
ony PCR product using BigDye Terminator v3.1 (Applied Bio¬ 
systems) and electrophoresis on an ABI PRISM 3130 (Applied 
Biosystems). An internal locus-specific primer was designed 
from the sequencing results and used for PCR amplification and 
direct sequencing. 

In order to screen for sequence polymorphims, PCR products 
were amplified from between eight to sixteen individuals of P. 
thunbergii (Table 1). The PCR reaction was set up in a volume of 
10 p 1 containing 1 x PCR buffer (Invitrogen), 1. 5 mM MgCl 2 , 
0.2 mM of each dNTP, 0.25 U Platinum Taq DNA polymerase 
(Invitrogen), 0.2 pM primer pair, and 20 ng template DNA. A 
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touchdown PCR program was used in all primers as follows: 
94°C for 1 min; followed by 10 touchdown cycles of 94°C for 30 
s, 62-55°C (decreasing 0.7°C per cycle) for 30 s, and 72°C for 1 
min; followed by 25 cycles of 94°C for 30 s, 55°C for 30 s, and 
72°C for 60 s; followed by a final extension at 72°C for 2 min. 
For the sequencing reaction, the same chemicals were used, as 
described above. Sequencing results were base-called and 
aligned using Sequencher v.4.2 (GeneCodes). 

Homologous sites from each locus alignment that appeared to 
exhibit sequence variation, either in heterozygous or homozy¬ 
gous forms, were evaluated by manual inspection. Nucleotide 
diversity of each locus was quantified as 0 = K/\L* [1 - 1 + 2' 1 + 
3" 1 +... + (n- 1)" 1 ]] (Watterson 1975; Primmer et al. 2002), where 
K is the number of observed polymorphic sites, L is the total 
length of the sequence (in bp), and n is the total number of 
chromosomes screened. Number of chromosomes screened is 
defined that in a diploid organism, it would carries two copies of 


the locus, and thus two chromosomes were screened. The for¬ 
mula corrects for different sequence lengths and number of 
chromosomes screened among loci. 

To test if the SNPs discovered in this study are useful for ge¬ 
netic studies, one of the most polymorphic sites in each locus 
was selected and used for estimation of the observed and ex¬ 
pected heterozygosities (Ho, Ho) and tested for Hardy-Weinberg 
equilibrium. These analyses were done using Cervus v3.0.3 (Ka- 
linowski et al. 2007) and Genepop online version 
(http://genepop.curtin.edu.au/), respectively. 

Results and discussion 

A list of anonymous nuclear loci generated in this study is pre¬ 
sented in Table 1. 


Table 1. Characterization of 14 anonymous nuclear loci, developed from Pinus thunbergii. 


Locus 


Primer sequence (5'-3') 

No. of chro¬ 
mosomes 

screened 

No. of bp 

screened 

No. of 

SNPs 

0 x 10 3 

Heterozygosity 
SNP identity Ho 

He 

Accession 

no. 

mKOl 

F: 

CT G AGT G AAAAGTT ACGGT AG ATT G AAG 

30 

407 

2 

1.24 

174: A/C 

0.200 

0.186 

AB607011 


R: 

CGT GTCTT GT GT ACT GCG AT GT C 









mK03 

F: 

CCCT GAT AGCG AT AAGCG AT AAG AG 

32 

330 

5 

3.76 

139: A/G 

0.375 

0.315 

AB607012 


R: 

CCT GAT CC AACGGTC A AT AT GC 









mK08 

F: 

TGCTTGAAGAAGCTCCATGTTCC 

32 

618 

3 

1.21 

310: C/T 

0.313 

0.272 

AB607013 


R: 

TTTTGTTTTGGTACAGTTCTTTTCTCTG 









mK37b 

F: 

CAT GCT CTT GATT GG ACT GC AC AC 

32 

467 

2 

1.06 

280: A/C 

0.188 

0.272 

AB607016 


R: 

G AAAT GG AGG AGGG AT AAGG AGG 









mK45 

F: 

GGTT GT GG AC ATTT C AGT G A ATT G 

30 

448 

22 

11.83 

304: C/T* 

0.000 

0.515 

AB607018 


R: 

TT GGTT GGCT C ACG AAAGTTCT G 









mK46a 

F: 

GGTT GT GTT ATT GC AT ATCGC AT G 

32 

601 

8 

3.31 

442: A/G 

0.563 

0.466 

AB607019 


R: 

GG AT GAT AC AAAT G AC AAG ACCTCC 









mK48a 

F: 

GTCTGGTTGTTGCAGGTGTGG 

32 

457 

21 

11.41 

362: G/T 

0.250 

0.315 

AB607020 


R: 

ATCAAATTTTCTTACCTTGGCTACCTC 









mK49b 

F: 

C AAG AAG AACT AT GT GCCTCGT GAT C 

32 

481 

24 

11.87 

159: A/T 

0.063 

0.175 

AB607021 


R: 

GTGTCAAAGTATTACGATATTTCAGGTGG 









mK51a 

F: 

CTC AAGC AT AAG AGGTT G A AAT C ACC 

16 

479 

3 

1.89 

308: A/C 

0.000 

0.233 

AB607022 


R: 

AAT GCT C AG AG AT AT GC AAT G AACC 









mK63a 

F: 

TCCTGGTGGTATTTCACTCTTTCG 

32 

417 

4 

2.38 

364: C/T 

0.313 

0.272 

AB607023 


R: 

AAAATT GCAGC AAAAGTT GT G A AG 









mK65a 

F: 

GACAAGCTCGAACCGACATCAA 

32 

405 

3 

1.84 

255: C/T 

0.063 

0.063 

AB607024 


R: 

CGGT CAT GT AATTT AATT GT GCT AAGC 









mK67b 

F: 

TGT GC AATTTTC ACT AGG AGT GTTG 

32 

340 

16 

11.69 

131: A/C 

0.063 

0.175 

AB607025 


R: 

AC AAGCT CG AT CC AG AG AGTT GG 









mK68b 

F: 

GCTACTGTGGATGTAGCAGTTATGG 

32 

443 

3 

1.68 

148: A/G 

0.438 

0.353 

AB607026 


R: 

AAGCTCGC AAC AC AAAG AT CG 









mK71b 

F: 

AACCAATTGCTACCCTTTATACCACTC 

32 

516 

10 

4.81 

199: G/T 

0.500 

0.516 

AB607027 


R: 

CGACAAGCTCGTTCCATAACTCACTC 











Total/ Average 

428 

6409 

126 

4.99 






0, nucleotide diversity; SNP identity, position and alleles of SNP site used for calculation of observed heterozygosity (Ho), expected heterozygosity (Ho), and the 
goodness of fit to the Hardy-Weinberg equilibrium; *, significant deviation (p < 0.05). 


Fourteen sets of locus-specific primers successfully produced 
a PCR product suitable for direct sequencing, i.e., the PCR am¬ 
plification resulted in a single, strong band as visualized by aga¬ 
rose gel electrophoresis. High-quality sequences were obtained 
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from these loci with a total length of 6,409 bp. One hundred and 
twenty-six SNPs were identified, which translates to an average 
of 1 SNP per 51 bp. Nucleotide diversity (0) of the individual 
loci ranged from 1.06 x 10 3 to 11.87 x 10‘ 3 , with an average of 
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4.99 x 10‘ 3 . The Ho ranged from 0.000 to 0.563, whereas He 
ranged from 0.063 to 0.516. Using only the most polymorphic 
SNP site within each locus, we found that one locus (mK45) 
violated the assumptions of the Hardy-Weinberg equilibrium (p < 
0.05). The remaining 13 loci should be regarded as unlinked loci. 

The developed loci were further tested in five pine species to 
confirm their cross-species applicability (Table 2). 


Table 2. Cross-species transferability of 14 anonymous loci in 5 pines 
species. ++, single band in the expected size; +, band in expected size 
with other nonspecific bands; no PCR product. 


Locus 

Subsection Pinus 

Subsection Australes 

Pinus 

luchuensis 

Pinus 

densiflora 

Pinus 

masson- 

iana 

Pinus palus- Pinus taeda 

tris 

mKOl 

+ + 

- 

- 

- 

mK03 

++ 

- 

- 

- 

mK08 

++ 

- 

++ 

- 

mK37b 

++ 

++ 

+ 

- 

mK45 

- 

- 

- 

- 

mK46a 

++ 

++ 

+ 

- 

mK48a 

++ 

++ 

++ 

++ ++ 

mK49b 

- 

++ 

- 

- 

mK51a 

++ 

++ 

++ 

++ ++ 

mK63a 

++ 

- 

- 

- 

mK65a 

++ 

++ 

++ 

- 

mK67b 

++ 

- 

++ 

- 

mK68b 

++ 

++ 

++ 

- 

mK71b 

++ 

++ 

++ 

- 


Four individuals from each pine species were used for the test. 
In 13 of 14 loci, a single PCR product of the expected size was 
successfully amplified in at least 1 species. In both P luchuensis 


and P. densiflora , seven loci were applicable, a result that was 
encouraging for further use of these loci in experiments to study 
the comparative nucleotide variation among these pines. The 
SNPs discovered in this study also provide important material for 
SNP-based marker development in P. thunbergii and related pine 
species. 
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